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Abstract 

Formulas are provided for the cumulants and the moments of the time T back to the most 
recent common ancestor of the Kingman coalescent. It is shown that both the jth cumulant 
and the jth moment of T are linear combinations of the values C(2m), m G {0,..., [ji/2j}, 
of the Riemann zeta function C, with integer coefficients. The proof is based on a solution 
of a two-dimensional recursion with countably many initial values. A closely related strong 
convergence result for the tree length of the Kingman coalescent restricted to a sample 
of size n is derived. The results give reason to revisit the moments and central moments of 
the classical Gumbel distribution. 
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1 Introduction 

Kingman’s coalescent mm, the most important coalescent among the class of all exchangeable 
coalescents, is a continuous time Markov process 11 = (nt)t>o with state space V, the set of 
partitions of N := {1, 2,...}. If this process is in a state with b blocks, then by definition during 
each transition any two blocks merge together at rate 1. This process starts in the partition of 
N into singletons and reaches its absorbing state, the partition consisting of the single block N, 
in finite time almost surely. Recently there is much interest in certain functionals of coalescent 
processes (restricted to a sample of size n G N) such as the number of jumps, the absorption 
time, or the total tree length to mention a few of them. In this manuscript we provide some new 
results on the absorption time and the total tree length of the Kingman coalescent. These results 
give reason to revisit the classical Gumbel distribution. 

The article is organized as follows. In the following Section[5]results on functionals of the Kingman 
coalescent, such as the absorption time and the tree length, are provided. Section |3] is devoted 
to the classical Gumbel distribution. We recall well known results of the Gumbel distribution 
but also shed some new light in particular on the central moments of this distribution. Proofs 
are provided in Section 01 The article finishes with an appendix where the hrst central moments 
of the Gumbel distribution are given explicitly. The appendix furthermore provides the spectral 
decomposition of the transition matrix of a pure death process having distinct death rates. 
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2 Absorption time and tree length 

Kingman [5] studied the absorption time T of 11. In the biological context T is called the time 
back to the most recent common ancestor or the age of the most recent common ancestor. It is 
well known that T = infinite convolution of independent exponentially distributed 

random variables with parameter := k(k — I)/2. Using the inversion formula for Fourier 
transforms it is readily checked that T has a bounded and infinitely often differentiable density 
5 : R —>■ [0, oo) with respect to Lebesgue measure A on R. Kingman [51 p. 37, Eq. (5.9)] showed that 
g{t) = “ l)Afee“'^'=‘, t G (0, oo). It is furthermore known (Watterson [T51 p. 213], 

Tavare [T51 p. 132] that T has mean E(T) = 2 and variance Var(T) = 47r^/3 — 12 1.15947 

and, hence, second moment E(T^) = 47r^/3 — 8 Ri 5.15947. To the best of the authors knowledge 
higher moments and cumulants of T have not been derived so far. Proposition 12.11 and Theorem 
12.21 below provide full information on the cumulants and moments of T. 

Proposition 2.1 For the Kingman coalescent the absorption time T has cumulants 

oo 

.,{T) = w 


and moments 




fc =2 


\ (2m — l)f (2m + 1)-^ 


j e N. (2) 


In particular, Kj{T) ^ (j — 1)! and E(T'^) ~ 3j! as j —)• oo. Alternatively, 


nT^) = j! E 


2<k-i<---<kj 


Afci * • * Afc 


‘ =i!2> y: n ‘ 


2<ki<"-<kj 


h{ki - 1 )’ 
1 — 1 ' ' 


j € N. (3) 


Remarks. 1. For n G N let Tn denote the absorption time of the Kingman coalescent restricted 
to a sample of size n. The proof of Proposition 12.11 provided in Section S] shows that T„ —>■ T 
almost surely as n —>■ oo with convergence of all moments, which implies (see, for example, [5J 
Proposition 3.12]) the convergence T„ —>• T in for any p G (0,oo). Moreover, the sequence 
is uniformly integrable for any p G (0, oo). 

2. Let : R —>■ [0, oo) denote the density of T„. Using the inversion formula for Fourier transforms 
it is straightforward to establish the local convergence result supjgR|g„(t) — g{t)\ = C)(l/n). 
Scheffe’s theorem (see, for example, [U Theorem 16.12] implies that gn ^ g in L^, and, therefore, 
|P(T„ € B) — P(T G B)\ = I Jg dA — /g gdAj < / Ign — gj dA —>■ 0 for all Borel sets R C R. 

3. With some more effort it can be even verified that sup^gR \n{gn{t) — g(^t)) — 2g'{t)\ = 0{l/n). 
The proof of this result is again based on the inversion formula for Fourier transforms, however a 
bit technical and therefore omitted here. We will not use this advanced local convergence result 
in our further considerations. 


The following result shows that the cumulants Kj{T) and the moments E(T'^) of T are related to 
the Riemann zeta function More precisely, Hj{T) and E(T^) are both linear combinations of 
the zeta values ({2m), m G {0,..., [j/2j}, with integer coefficients. 
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Theorem 2.2 For all j € N, 


k,(t) = (-ir2‘*‘ ^ <“> 

m=0 

and 

E(T‘) = (-l)»j2»+‘ Z&m - 1) (l - C(2’n), (5) 

where denotes the zeta function. 

Remarks. 1. The coefficient in ® in front of ({2m) is integer, since 2-^+^(l — 1/2^™“^) = 
2^+1 _ 2 f- 2™+2 g 2 and (2j — 2m — 2)!/(j — 2m)! S Z for all j S N and all m € {0,..., [j'/2j}. 
2. Let denote the Bernoulli numbers defined recursively via Bq := 1 and B^ := 

— l/(n+ l)X]fc=o n e N. For instance, Bi = —1/2, and B 2 = 1/6. Since ({2m) = 

(— l)'"“^(27r)^™/(2(2m)!)i?2m is a rational multiple of tt ^™, Theorem 12.21 implies that Kj{T) 
and E(T1) are both polynomials of degree [//2J with rational coefficients evaluated at tt^. The 
following tables are easily computed using dU and ®. 


j 

Kj{T) in terms of /■(.) 

Kj{T) in terms of tt 

Kj{T) numerically 

1 

2 

2 

2.00000 

2 

8C(2) - 12 

1^2 _ 12 

1.15947 

3 

160 - 96C(2) 

160 - 167r2 

2.08633 

4 

192C(4) -b 1920C(2) - 3360 

-b 3207r2 - 3360 

6.07947 

5 

-7680C(4) - 53760C(2) -b 96768 

-^TT^ - 89607r2-b 96768 

24.10210 


Table 1: The first five cumulants of T. 


j 

E(T'^) in terms of ({.) 

E(T'^) in terms of n 

E(T'^) numerically 

1 

2 

2 

2.00000 

2 

8({2) - 8 

4^2 Q 

gTT 8 

5.15947 

3 

96 - 48C(2) 

96 — 87r^ 

17.04317 

4 

672C(4) -b 768C(2) - 1920 

-b 1287r2 - 1920 

70.63058 

5 

-20160C(4) - 19200C(2) -b 53760 

-2247r4 - 32007r2 -b 53760 

357.62953 


Table 2: The first five moments of T. 


The proofs provided in Section |4] rely on the fact that the jump chain of the block counting 
process of the Kingman coalescent is deterministic, which implies that T is an infinite convolution 
of exponentially distributed random variables. The proof of Theorem 12.21 is based on a solution 
of a two-dimensional recursion with an infinite number of initial values (see Lemma EH) . 

Our methods do not seem to be directly applicable to (absorption times of) other exchangeable 
coalescent processes, since the jump chain of the block counting process of a coalescent with 
multiple collisions is in general not deterministic. 

Our methods are partly useful to analyze further functionals of the Kingman n-coalescent (re¬ 
stricted to a sample of size n G N). As an example we provide detailed information on the tree 
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length Ln (the sum of the lengths of all branches of the n-coalescent tree) of the Kingman n- 
coalescent. Let G be a standard Gumbel distributed random variable with distribution function 
X !->■ exp(—exp(—x)), a; G R. It is well known (see, for example, [TH p. 22-23], [3] Lemma 7.1] or 
[U Lemma 2.21]) that Ln has the same distribution as the maximum of n — 1 independent and 
exponentially distributed random variables with parameter 1/2 and that Ln/2 — logn —>■ G in 
distribution as n —>■ oo. We verify the following stronger convergence result. 

Theorem 2.3 (Strong asymptotics of the tree length) For the Kingman coalescent, as 
n —>■ 00 , Gn ■= Ln/‘2, — logn —>■ G almost surely and in for any p G (0,oo), where G is 
standard Gumbel distributed. Moreover, the sequence (GO)„gN Is uniformly integrable for any 
p G (0,oo). 

Remarks. 1. Theorem 12.31 implies the convergence Kj{Gn) —>■ Kj{G) of all cumulants and the 
convergence E{Gf^) —>• E(G-’ ) of all moments, which is one of the starting points of the proof of 
Theorem l2.3l Formulas for the cumulants and the moments of are provided in (EH), (HZI) and 

(IMl)- 

2. For asymptotic results on the tree length for A-coalescents with A = /3(a, b) being the beta 
distribution with parameters a,b £ (0, oo) we refer the reader to [7] Theorem 1] for 0 < a < 1 
and b := 2 — a and to [31 Corollary 4.3 and Theorem 5.2] for a = & = 1 (Bolthausen-Sznitman 
coalescent). The asymptotics of for S-coalescents with dust is provided in (TUI Theorem 3]. 


3 The Gumbel distribution revisited 

Theorem 12.31 and its proof smooth the way to establish a result (Theorem 13.11) on the Gumbel 
distribution, also called the extreme value distribution of type 1. Recall that a standard Gumbel 
distributed random variable G has (see, for example, [SI p. 12, Eqs. (22.29) and (22.30)]) cumu¬ 
lants Ki = ki(G) = 7 (Euler’s constant) and Kj = Kj{G) = (—l)^4/(J-i)(l) = (j — 1)1 j > 2. 
Note that Hj ~ (j — 1)! as / —>- 00 . Moreover, G has moments 

poo 

mn := E(G”) = / (-logu)”e-“ dw = (-1)”F(")(1), n G Nq := {0,1, 2,...}. (6) 

Jo 

The integral in ([S]) is sometimes called the nth Euler-Mascheroni integral. In the analytic commu¬ 
nity its interpretation as the nth moment of the Gumbel distribution often remains unmentioned. 
It is well known that 7n„ ^ n! as n —^ 00 . The moments mi, m 2 ,... can be recursively computed 
via the relation between cumulants and moments 


mn = X! _ 1 ^ 7 m„_i-b (n - 1)! 

k^l ^ ^ 

in agreement with the recursion provided on top of p. 214 in the book of Boros and Moll [2]. A 
useful and well known formula for the moments is 


E 

k=2 


1 


(n — k)\ 


C{k)mn-k, n G N, (7) 


^ E n n G N, (8) 

TTG'Pn B^TT 

where the sum extends over all partitions tt of the set Vn of all partitions of {1,... ,n}, and 
the product has to be taken over all blocks B of tt. Since there exist n!/(ai! • • • a„!l!“i • • • n!“'‘) 
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partitions tt G Vn having aj blocks of size j, 1 < j < n, the above formula can be also written as 


= n! 


(11,... ,Cln i — 1 


ai,...,an 1—2 ^ 


n G N, 


(9) 


where the sum extends over all oi,..., On G No satisfying instance, 

mi = 7 , m 2 = 7 ^ + C(2), and m 3 = 7 ^ + 37 C( 2 ) + 2C(3). In particular, m„ is a polynomial of degree 
n in the variable x := (cci,..., Xn) '■= ( 7 , C(2), ■ • ■, C(''^)) with nonnegative integer coefficients. 
In the following we focus on the central moments m'^ := E((G — 7 )"), n G No, of the Gumbel 
distribution. Clearly, 


m„ 


= E((G-7 + 7r) = ^eNo, 


( 10 ) 


and 








II -7 

i! > —r;— = n!e ' 


i=o 


j! 


as n ^ 00 . As for the moments we obtain for the central moments the recursion 


” / — 1 \ ”1 

k—1 ^ ' k—2 ^ 


k)i 


C{k)r 


^n — k 5 


G N, 


with solution 

= E n''isi(G-7) = ^! E n^(^) > 

TTG'Pn BGtt a2,...,an i — 2 ^ 2 

where the last sum extends over all 02 ,..., a„ G No satisfying 

ticular, for all n > 2 , Tn'„ is a polynomial of degree [n/ 2 j in the variable (C(2),... ,C(’^)) with 
nonnegative integer coefficients. Theorem 13.11 below provides an alternative formula for m'^. In 
order to state the result we introduce the nonnegative integer coefficients 

/_Ip' 

dn ■■= ( 12 ) 

j=0 d- 

Note that dn is the number of derangements (hxed point free permutations) of n elements. 
Clearly, d„ = ndn-i + (—1)", n G N. For instance, do = 1, di = 0, ^2 = 1, da = 2, di = 9, 
ds = 44, and dg = 265. A typical element of Vi will be denoted by tt := {Bi,... ,Bi}, where 
I := | 7 r| G {1,..., i} is the number of blocks of the partition and the blocks Bi,... ,Bi of the 
partition are non-empty disjoint subsets of {!,..., i} satisfying i?i U • ■ • U B; = {1,..., i}. Note 
that the order of the blocks is unimportant. For a set B and indices rib, b G B, we use in the 
following the notation ub '■= ^beB 
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Theorem 3.1 (Alternative formula for the central moments) A standard Gumbel dis 
trihuted random variable G has central moments TOq = 1, m'l = 0 and 


< = E 


ni + --- + ni = n 


dm ''' dji^ 

m! 


i{ni,...,ni), n>2, 


(13) 


with coefficients dn defined in m and Siffii,... ,ni) given via 

1 


Si{ni,...,ni) := ^ 


all distinct 




1=1 


(14) 

■CiuBi) (15) 
(16) 




BG'TT 


where ( denotes the zeta function and |7r| the number of blocks of the partition tt. 

Remarks. 1. The proof of Theorem [3T] is based on the fact that the random variable 5'„ defined 
in (1261) converges to G — 7 as n —^ oo. For more information on this convergence we refer the 
reader to the proof of Theorem 12.31 

2. Clearly, Si(ni,..., Ui) is symmetric with respect to the entries ni,... ,ni. For instance, si(ni) = 
C(ni), S 2 (ni, n 2 ) = C(»t-i)C(»^ 2 ) - Cini + 712 ) and S3(ni, n 2 , ns) = Cini)C{n2)C{n3) - C(ni)C(n 2 + 
^ 3 ) — C(^2)C(^i + 113 ) — ((,n 3 )({ni + 712 ) + 2((ni + 712 + ns). A recursion for 5 ^( 771 , ... ,ni) is 
provided in (|2^ . Eqs. dig and dig are reminiscent of sieve formulas, but we have not been able 
to rigorously relate these equations with some known sieve formula. 

3. The values of the central moments m'^ for 0 < ti < 10 are listed in the appendix. The Tith 
raw moment 7 n„ of the Gumbel distribution is either obtained via ([ 8 |) or (j9|) , or from the central 
moments tti', 0 < j < n, via dig. The book of Srivastava and Choi [HI pp. 370-371] contains 
the values of (—l)" 77 i„ = r*^"^(l) for 1 < n < 10. 


4 Proofs 


Proof, (of Proposition 12.ip For 71 € N let denote the absorption time of the Kingman co- 
alescent restricted to a sample of size n. Clearly (see, for example, Kingman [5J Eq. (5.5)]), 

T„ = X)fc= 2 ''’fc! where T 2 ,T 3 ,... are independent random variables and Tk is exponentially dis¬ 
tributed with parameter Xk = k{k — l)/2, k > 2. Thus (see, for example, Ross [TTJ p. 309]), T„ 
has a hypoexponential distribution with density gffit) := X)fe =2 0'nkXke.~^'‘*, t > 0, where 


^nk • — 


n 

i =2 


A, 

Xj Xk 


(_l)fc(2fc-l)- 

(n 


n\{n — 1)! 
k)\{n + k — 1)! ’ 


2 < k < n. 


Alternatively, the density of T„ is obtained as follows. Let D = {Dt)t>o denote the block 
counting process of the Kingman coalescent restricted to a sample of size n. From Lemma 15.11 
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(spectral decomposition) provided in the appendix it follows that T„ has distribution function 


P(T„ < t) 


P(A = 1 ) = X!® 

k=l 






i = 2 


^3 

\j Afc 


1 



n 

tG[0,oo), 

k=2 


where the second last equality holds since Ai =0. Taking the derivative with respect to t it 
follows that T„ has density gn- In particular, Tn has moments 

pOO pOO ^ 

E{Ti) = / = Va„fe / t^Afce-^'=‘dt = j € N. (17) 

•^0 fe=2 -^0 k=2 K 

In the following it is shown, essentially by letting n —>■ oo in inn, that 


E(r^) 


^' 2 -^ \j ’ 


3 e N. 


(18) 


Eq. (IT^ holds for j = 1, since E(r) = 2. Assume now that j > 2. From Tn T = J2T=2 
almost surely as n ^ oo it follows by monotone convergence that the left hand side in (HZl) 
converges to the left hand side in (1181) as n —>■ oo. In order to see that the right hand side in (TTfl) 
converges to the right hand side in (IT8l) as n —>■ oo fix e > 0. Since j > 2 and Afc = k{k — l)/2, 
the series J2T=2i‘^^ ~ ^)l^k i® absolutely convergent. Thus, there exists a constant no = no(£) 
such that ~ ^)l^k ^ Noting that ank = (—l)^(2k — l)bnk with 


0 bnk • — 


n!(n — 1)! 


i(n — 1) • • • (n — fc + 1) 


(n — fc)!(n + fc — 1)! (n + k — l)(n + A: — 2) ■ • • n 
it follows for all n > uq that 


< 1 


ank ^{-l)H‘2k-^) 


^nk 

k=2 ^k k=2 


Ai 


< 


< 


< 


k=2 '^k k=n+l '^k 


^2k-l., ^ 2fc-l 

/c =2 '^k 


\3 

k—no-\-'i- '^k 


^2k-l.^ ,1 

/ . — — —|Onfe~l|+e £ 


k=2 ^k 


as n —>• OO, since bnk ^ 1 as n —>• oo for each fixed k G {2,3,...}. Since £ > 0 can be chosen 
arbitrarily small, it follows that the right hand side in (ED converges to the right hand side in 
ED- Thus, (I18|) is established. Distinguishing in (I18|) even k = 2m and odd k = 2m + 1 and 
summing over all m G N it follows that 


E(r^') = 


^ 1 / 4m-1 
^ ml I (2m — 1)-’ 

m—1 ^ ^ ' 


4m + 1 ^ 
(2m + 1)J ) ’ 


j gN, 
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and ([2]) is established. Let us now turn to the cumulants of T. Note that the exponential distri¬ 
bution with parameter A € (0,oo) has jth cumulant (j — 1)!/A'^ . From T„ = follows 

that Tn has cumulants 


n n i' ■ _ 1 M » 1 

^ ^ ^ ^ j G N. (19) 

fc =2 k=2 '^k k=2 ' ' 

We have verihed above that the moments of Tn converge to those of T, which implies that the 
cumulants of Tn converge to those of T. Letting n —>• oo in (IT^ yields ([T]). 

Note that Kj{T)/{j — 1)! = Y ^^=2 ^l^k ^ ~ ^ 3 as 

j oo, since A 2 = 1 and Afc > A 3 = 3 for all k > 3. 

It remains to verify the alternative formula ([3]) for the moments of T. For all j G N we have 

E(r^) = EUf^TkY) = 

00 00 .1 00 

= i: n EW") = i: n 

ki,...,kj=2m=2 2<ki<---<kj t lm=2 ^=2 

where, for m > 2, am denotes the number of indices being equal to m. Note that 

Sm =2 ~ T Since = am}-l'Kn i the above expression simplihes to 


E(r^') 


E 

‘2<k\<‘“<kj 



i: n 

2<ki<---<kj i=l 


1 

ki{ki - 1) ’ 


which is □ 

The proof of Theorem l2.2l is based on the following basic but fundamental lemma, which provides 
a solution for a certain two dimensional recursion with a countable number of initial values. 

Lemma 4.1 Let ai, &i, 02 , & 2 ) • ■ • G K. For z, j G N define Sij recursively via Sij := Si-ij — Sij-i, 
with initial values sok ■= o,k and Sko ■= bk, /c G N. Then 


Sij — 


k^l ^ 






z + j — fc — 1 

J -1 


bk, 


i,j gN. 


( 20 ) 


Proof, (of Lemma |4T|) Induction on i+j. Clearly, (l20l) holds for i+j = 2, since sn = sqi — sio = 
Oi — 61 . For the induction step from z -|- j — 1 to z + j (> 2) three cases are distinguished. 

Case 1: If j = 1, then z > 1 and, hence, by the recursion and by induction, s^i = Si_ip — s^o = 
(«! - EEi ^k) -bi = ai- Yl=i h, which is (gHl) for j = 1 . 

Case 2: If z = I, then j > 1 and, hence, by the recursion and by induction, sij = soj — sij-i = 
<^j - -I- {-iy~^bi) = Y,i=i{-iy~^ak + {-lybi, which is ([201) for z = 1. 







Case 3: If i,j>l then 


Sij — 

j 

= E(-i) 


k^l 


j-k(^+J-k-2 

i-2 


2-1 


a-k + (-l)^E 


fc=i 


i+ j-k-2 
3 - 1 


i-i 






k^l 




bk 

i + j-k-2 
J-2 






i+j — fc — 2^ ^ — k — 2 

i — 1 


i-2 

2-1 


Cik 


+{-iyb, + {-i)iJ2 


= E(-i) 


j_k(i+j-k-l 


i j — k — 2^ ^ fi-\-j — k — 2 

j - 1 


k^l 


i — 1 


J-2 
2 

«fc + (-i)^E 

fe=i 




i + j — /c — 1 

j - 1 


bk, 


which completes the induction. 


□ 


Before we come to the proof of Theorem 12.21 we provide a typical application of Lemma 14.11 
showing that this lemma can be used to determine the value of certain series. 

Example. For j € N we would like to determine the series — l)-^)- We proceed as 

follows. For i,j € No := {0,1, 2,...} with i+ j >2 define Sy := ~ !)'’)• Note that 

sii = 1, Sio = C(0 ~ 1; * > 2, and that sgj = C{j), j > 2. For all i,j e N with i + j > 3, 


= E 


E i-j-i/' 


1 1 


, I — Si-lJ Sij-I- 


/c* ^{k — iy ^ k{k — 1) ^{k — iy ^ \k — 1 k 

If we additionally define Sqi := 1 and sio := 0, then this recursion holds also for i = j = 1, so 
for all i,j S N. By Lemma [4.11 for all j G N, 






k=l 


k=l 


2j-k-l 
J - 1 


Sfeo 


= (-1)^ E ^)i^ok + sko) + i-iy j2 -’ll ^)i^>^o-sok). 

k = 2 \ / fc = l \ / 

fc even k odd 

Plugging in sok + Sko = 2C(fc) — 1 for even k and Sfco — sofc = — 1 for odd k we obtain the solution 


E 


k3{k - ly 


= S, 






k = 2 
k even 


2j-k- 1 
j - 1 


C(fc) 


= (- 1 )^+' 


2 j-l 

j 


um 


+ 2(-ir E 


m—1 


2j — 2m — 1 
3 - 1 


C(2m) 
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j e N, 


( 21 ) 


b72j 

= 2 (- 1 )^E 


m—0 


2j — 2m — 1 

j - 1 


C(2to), 


since C(0) = —1/2. For instance S 22 = 2C(2) — 3 = tt^/S — 3 ~ 0.28987 and S33 = 10 — 6/(2) = 
10 -TT^ « 0.13040. 


The following proof of Theorem 12.21 has much in common with the previous example, but a 
modified double sequence is used having in particular more involved initial values. 

Proof, (of Theorem 12.2|) The formula (|1]) for the cumulants of T follows directly from ([1]) by 
multiplying (j2ip with (j — 1)!2^. In order to verify the formula ([S]) for the moments of T define 
for i, j G No with i+ j >2 

" (_l)fe(2fc-l) 


-E' 




k^{k-iy 


By Proposition 12.11 E(Tl) = j\2^Sjj. Thus it remains to verify that 


B/2J 


= 2(-l)^ ^(2m-l) 1- 


m=0 


( 2 j - 2 m- 2 )! 


C(2m), j G N. (22) 


22 m-ly (^j _ 1)1 (j — 2m)! 

It is straightforward to check that sn = I, S 02 = 21og2 + /(2)/2, S 20 = 1 — 21og2 + /(2)/2, 
S.0 = I - 2(^1 - - 1) + (1 - ^)cw, * > 3, 


and 


50 , = 2(^1 - - 1) + (1 - ^)c(j), J > 3. 


For all i,j G N with i + j > 3 we have 

_ ^ (-l)'=(2/fc-I) 1 
- I^TITTi 


k^ ^{k — 1)1 ^ k(k — 1 ) 


E 


(-l)'=( 2 A:- 1 ) / 1 1 


k^ i(fc- 1)J 1 Vfc- 1 k 


1 ) — (23) 


If we additionally define Sqi := I and sio := 0, then the recursion (|23)) holds also for f = j = 1 , 
so for all G N. By Lemma r4.il 


h, _ 1 \ J / u _ 1 


'^33 






2j-k- 1 
3 - 1 


SfcO: j S N. 


Ordering with respect to even and odd k yields 

' " 2 j -k-1 

j 


k=2 
k even 


“ E ( j_i j(5ofe + sfeo)+ E 


fc=i 
k odd 


2j-k-1 
3 - 1 


(sfco — Sofe). 
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Plugging in sok + Sfco = 1 + 2(1 - 1/2'" ^)C(fc) for even k, sio - sqi = -1, and Sko - sofc = 
1 — 4(1 — l/2'"“^))^(fc — 1 ) for odd A; > 3, it follows that 





k odd 



k odd 



k even 



k even 


Thus, 

3 

(—l)^Sjj = Cj + 'y ( CjkCik)^ 

k = 2 
k even 

where Cj := = -(2j - 2)!/(j!(j - 1)!) and 


(24) 


Cjk 


:= 2 1 - 


1 


= 2 1 - 


= 2 1 - 


2/c-i 

1 

1 


2j-k-l\ _j2j-k-2 

j-1 y V j-1 


( 2 j-fc-l)! 


(j - !)!(/ -^)! 

( 2 j -k-2)\ 


- 2 - 


{2j-k-2)\ 


= 2 (fc-l) 1 - 


2'"-V(j-l)!(j-fc)! 

1 \ (2j-fc-2)! 


(( 2 j - fc - 1 ) - 2 (j - k)) 




Substituting k = 2m in (l24l) and noting that </(0) = —1/2 yields (l22l) . The proof is complete. □ 

In order to prepare the proof of Theorem 12.31 recall that a standard Gumbel distributed random 
variable G has (see, for example, [3 p. 12, Eqs. (22.29) and (22.30)]) cumulants ki(G) = 7 
(Euler’s constant) and Kj{G) = (—l)' 54 /(l-i)(l) = (j — l)!^(j), j > 2 . 


Proof, (of Theorem 12.31) We start similar as in the proof of Proposition 12.11 Clearly, L„ = 
X)fc =2 kTk = X)fc =2 where the random variables Xk := kru, k >2, are independent and Xk is 


11 
















exponentially distributed with parameter := Xk/k = {k— 1)/2, k >2. Thus, has cumulants 


n n / • _ -1 \| n -i 

= (j-l)'2^E7fc3W> (25) 

fe=2 fe=2 ^^k k=2 ^ ' 

For j = 1 we have ki(G'„) = ki(L„/ 2 - logn) = ki(L„)/ 2 - logn = J2k=2 l/(^ “ 1) “ log’^ 

7 = ki(G') as n —>• cxd. For j > 2 we have Kj(Gn) = Kj(Ln)/2^ = {j — G)\Yk =2 

{j — l)!C(j) = kij{G) as n ^ oo. Thus, we have convergence Kj(G„) ^ Hj{G) as n ^ oo of all 

cumulants, which implies the convergence E{G{^) —)• E(G-') as n —oo of all moments. 

For the convergence G„ := Tn/S — logn —i> G in distribution as n oo we refer the reader to [3l 
Lemma 7.1] and the references in the remark thereafter. In order to verify that the convergence 
G„ —>• G holds even almost surely define 


Sn := 


Ln — E(L„) 


E 

k^2 


Xk - E(Xfc) 


k=2 


Yk, n G N, 


(26) 


where Yk := {Xk — E(Xfe))/2, k > 2. Note that F 2 )U)--- are independent. From E(L„) = 
2 1/(^“ 1) = 2 logn + 27 + 0(l/n) we conclude that S'„ —>■ G —7 in distribution as n —5> 00 . 

It is well known (see, for example [3 Theorem 22.7]) that a sum 5'„ = X]fe =2 of independent 
random variables converges in distribution if and only if it converges almost surely. Thus we even 
have Sn ^ G — "f almost surely. Thus, Gn ^ G almost surely as n ^ 00 . By [ 6 l Proposition 
3.12] it follows that Gn ^ G in for any p G (0, 00 ), and the sequence (G())„gN is uniformly 
integrable for any p G (0, 00 ). Note that |]S'„ - (G - 7 )!]^ < II 7 + Sn - G„||p + 11G„ - GJJp = 
II 7 + logn — J2k=2 “ l)llp + 11^" ~ ^llp —^ 0 as n —>• 00 , so we also have S'n —)• G — 7 in 

LP for any p G (0, 00 ) and, hence, convergence E(5'j() ^ E((G — 7 )-’) as n —> 00 of all moments. 
Furthermore, (S'„)„gN is a martingale, but we did not use this property in the proof. □ 


Remark. In this remark formulas for the moments of the total tree length are provided. 
It is known (see, for example, [3] Lemma 7.1] and the remark thereafter) that Ln has the same 
distribution as the maximum of n — 1 independent and exponentially distributed random variables 
with parameter 1/2. In particular, L„ has distribution function P(Lr!, < i) = (1 —t > 0, 
and, hence, moments 


pOO pOO 

HK) = / >t)dt = / jL-\l - (1 - e-‘/ 2 )»-i) dt 

Jo Jo 

poc n—1 / 71—1 / «oo 


= E(-i: 


/c+l 




k J {k/2y ^ 




y 


j e N. 


(27) 


Alternatively, 


k=2 


E{Li) = J = ^ E{Xk,---Xk^ 

ki,...,kj—2 


j! 


^ E(A“^).-.E(A^) = ^ _A__e(A“=).-.E(X^), 

ki,...,kj—2 2<fci <n 


02 ! ■ • ■ a„! 
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where, for m S {2,...,n}, Um denotes the number of indices ki,... ,kj being equal to m. Since 
= am'/, the above expression simplifies to 


mi) 


j'- E 


1 

Mfei ■ ■ ■ fJ-kj 




E 





j G N. 


(28) 


Comparing (|27ll with (|28l) leads to the combinatorial identity 


n 1 

\ k ) 


I<fci<---<fc 3 <n-l 


Proof, (of Theorem I3.1|) Note first that an exponentially distributed random variable X with 
parameter a G (0,oo) has moments E(X”) = nl/a'^ and central moments E((X — E(X))"’)) = 
dn/d^ with dn defined in m, n G Nq. Consider the random variable defined via (1^51) . From 
the proof of Theorem 12.31 it is already known that S'tv ^ G — 7 almost surely as N ^ 00 with 
convergence of all moments. Moreover, for all n G No, 


E{S^) 


E((y2 + • • • + Yn)^ 


EE E 

i—1 2<.ki<.---<.ki<.N 

ni + .-.+r, 

n N 

T- T T 

all distinct 

n 

m E 

i=l "1 ", 

ni + .-.+„ 

n 

>^E/ E 


m! ■ ■ -nil 




E(Y/l^)---E(Y/‘;) 


all distinct - l-n^=n 

1 1 


N 


(** 

1 


ai + ---+ni = n 


1 X ^ ^ni ■ ' ■ 


E 

,. . . , fc.^ : 
all distinct 

N-1 

E 


ri, I . . . T7 I ^ ■ . ■ k'^' ’ 

,=n all distinct 



since di = 0. Letting —>■ 00 it follows that G has central moments (IT^ with Si(ni,... ,ni) 
defined via m- Obviously, the expressions in m and (HU) coincide. Thus, it remains to verify 
that Si{ni,... ,ni) can be expressed in terms of the zeta function via (1151) . We show this by 
induction on i G N. Clearly, m holds for i = 1, since si(ni) = C(^i) for all ni > 2. Concerning 
the induction step from 1,..., i to i + 1 (> 2) note first that, for alH G N and all ni,..., >2, 


Si+i(ni,..., rii+i) = 


E 

all distinct 




E 


. fc * + ^ 

" fei+ieN\{fei,...,fci} 


E ... k/ 

....fcjeN J- ^ 


fci 

all distinct 


1 


E r,"*+i 

fci+iSN ^*+1 


1 

r—1 


i 

(^1 ; ■ • ■ ; (^ 2+1 ) ^ ^ ( j^l ; • ■ • ; ^r— 1 5 “ 1 “ ^ 2+1 ; ^r + 1; • ■ • ; ■ 

r—1 


(29) 
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By induction we conclude that 


Si{ni, si{ni+i) 

i 

= E (l^il - 1)! • • • (l^'l - 1)! • • • C(^B,)C(n.+i) 

i=l 

I 

= E(-i)*”' E (\Bi\ -1)! • • • m+i\- 1)! CM- ■ ■ 

1—1 

= E(-ir+^-' E (|i?i|-l)!---(|i?/|-l)!C(nBj---C(nBj. (30) 

1=2 

B, = {i+1} 

Also by induction it is seen that 

I 

^ ( Sj (tT-i , . . . , TZj’ — i, Tlr “t“ ^i+ 1 1 Br+ 1 7 • ■ • 5 
r—1 

i i 

= E E(-l)*”' E (l^il - 1)! • • • (l^'l - 1)' C(nAj • • • C(nA,_ JC(nA,u{.+i}), 

r=l 1=1 {Ai,...,Ai}€Vi 

reA, 

where, in the last sum, we ordered without loss of generality the blocks Ai,..., A; of the partition 
such that the element r belongs to the last block A;. Reordering the sums on the right hand side 
yields 

i 

^ ( 'Si(ni, • ■ • , Tlr—li Br “1“ ^r+1; • ■ • ; Bi) 

r—1 

i i 

Rewriting this expression in terms of the blocks Bi := Ai,..., Bi-i := Ai_i and Bi := A;U{i+l} 
it follows that 

I 

'^s,{ni,...,nr-i,nr + ”^2 + 1 ; ^r+l5 ■ • ■ 5 ”^ 2 ) 
r—1 

i i 

The last sum (over r) consists of \Bi\ — 1 summands and these summands do not depend on r, 
which gives rise to a factor \Bi \ — 1 leading to 

i 

^ ^ • • • 1 ^r —15 ■ i ”^ 2 ) 

r—1 
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= E(-!)*■' E (si) 

1 = 1 

B^^{i + 1} for all k<={l .1} 

Subtracting dSU from (Isni) and recalling (l29ll it follows that 
i+l 

which completes the induction. Thus, m is established. □ 

5 Appendix 

Central moments of the Gumbel distribution 

For completeness we record the central moment of the Gumbel distribution for 0 < n < 10. 
Based on m we provide them in terms of the coefficients Si{ni,... ,ni), in terms of the zeta 
function, and numerically. The first central moments of the Gumbel distribution are mp = 1, 
m[ = 0, = si(2) = C(2) = 7rV6 Ri 1.64493, m'^ = 2si(3) = 2C(3) « 2.40411, = 

9si(4) + 3 s 2(2, 2) = 6C(4) + 3(^(2) = 37rV20 Rs 14.61136, m'^ = 44si(5) + 20s2(2, 3) = 24C(5) + 
20C(2)C(3) = 24C(5) + (10/3)7r2C(3) « 64.43235, 

m'e = 265si(6) + 135s2(2,4)+40s2(3,3) + 15s3(2,2,2) 

= 120C(6) + 90C(2)C(4) + 40C^(3) + 15(3(2) 

61 

= -7r6 + 40C^(3) Rs 406.87347, 

168 

m '7 = 1854si(7)+924 s2(2, 5)+630s2(3, 4)+ 21053 ( 2 , 2 , 3 ) 

= 720C(7) + 504C(2)C(5) + 420C(3)C(4) + 210C^(2)C(3) 

= 720C(7) + 847r2C(5) + y7r^C(3) « 2 815.13142, 

m's = 14833si(8) + 7420s2(2, 6)+4928s2(3, 5)+ 283552(4,4) 

+ 189053(2, 2,4) + 112053 ( 2 ,3, 3) + 105s4(2, 2, 2, 2) 

= 5040C(8) + 3360C(2)C(6) + 2688C(3)C(5) + 1260(^(4) 

+ 1260C^(2)C(4) + 1120C(2)C^(3) + 105C^(2) 

1961 ^60 

= ^7r8 + 2688C(3)C(5) + —712(2(3) R^ 22630.60731, 
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m'^ = 133496si(9) + 66744s2(2, 7) + 44520s2(3,6) + 49896s2(4,5) 

+ 16632s3(2, 2, 5) + 22680s3(2, 3,4) + 2240s3(3,3, 3) + 2520s4(2, 2, 2,3) 

= 40320C(9) + 25920C(2)C(7) + 20160C(3)C(6) + 18144C(4)C(5) 

+ 9072C2(2)C(5) + 15120C(2)C(3)C(4) + 2240(^(3) + 2520C^(2)C(3) 

= 40320C(9) + 43207r2C(7) + + 2240C^(3) + 6l7r®C(3) 

D 

« 203595.03670, and 

m'lo = 1334961 si(10) + 667485s2(2,8) + 444960s2(3,7) + 500850s2(4,6) 

+ 243936s 2(5, 5) + 166950s3(2,2,6) + 221760s3(2,3,5) + 127575s3(2,4,4) 

+ 75600s3(3, 3,4) + 28350s4(2, 2, 2,4) + 25200s4(2, 2,3, 3) + 945s5(2, 2, 2, 2, 2) 
= 362880C(10) + 226800C(2)C(8) + 172800C(3)C(7) + 151200C(4)C(6) 

+ 72576C^(5) + 75600C^(2)C(6) + 120960C(2)C(3)C(5) + 56700C(2)C^(4) 

+ 50400C^(3)C(4) + 18900C^(2)C(4) + 25200C^(2)C2(3) + 945C'^(2) 

4077 

= + 172800C(3)C(7) + 72576(^(5) + 201607r2C(3)C(5) + 12607r4C^(3) 

oo^ 

« 2036946.09776. 

A spectral decomposition 

Lemma 5.1 (Spectral decomposition of a pure death process) Let n e N and let X := 

{Xt)t>o be a pure death process on a probability space P) with state space {l,...,n} 

and pairwise distinct death rates di,... ,dn- Then, the transition probabilities Pij{t) := f‘(Xt = 
j I Xq = i) are given by 


= X!® i,j (32) 

k=j 

where the n x n matrices R = (rij) and L = (Uj) are defined via rij := Uj := 0 for i < j and 

n 

i=j+i •> i=j 

Proof. With some effort it can be checked that RL = I, where / = {Sij)i<ij<n denotes the nxn 
unit matrix. Let Q = {qij)i<ij<n denote the generator matrix of X, i.e. qa := —di, qi^i-i := di 
and qij = 0 otherwise. Furthermore, let D denote the diagonal matrix with entries dij := —di for 
i = j and dij := 0 otherwise. It is readily checked that RD = QR, and, hence, RDL = Q. The 
transition probabilities pij (t) of the process X are now obtained from the spectral decomposition 
P{t) := = R{e*^)L of the transition matrix P{t) as Pij{t) = X]fc=i ^~'^'°*">’iklkj = 

Y.l=je-'^’^^r,ulkj- □ 
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